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ABSTRACT 

Although chromatin folding is known to be of func- 
tional importance to control the gene expression 
program, less is known regarding its interplay with 
DNA replication. Here, using Circular Chromatin 
Conformation Capture combined with high- 
throughput sequencing, we identified megabase- 
sized self-interacting domains in the nucleus of a 
human lymphoblastoid cell line, as well as in 
cycling and resting peripheral blood mononuclear 
cells (PBMC). Strikingly, the boundaries of those 
domains coincide with early-initiation zones in 
every cell types. Preferential interactions have been 
observed between the consecutive early-initiation 
zones, but also between those separated by several 
tens of megabases. Thus, the 3D conformation of 
chromatin is strongly correlated with the replication 
timing along the whole chromosome. We further- 
more provide direct clues that, in addition to the 
timing value per se, the shape of the timing profile 
at a given locus defines its set of genomic 
contacts. As this timing-related scheme of chroma- 
tin organization exists in lymphoblastoid cells, 
resting and cycling PBMC, this indicates that it is 
maintained several weeks or months after the 
previous S-phase. Lastly, our work highlights that 
the major chromatin changes accompanying PBMC 
entry into cell cycle occur while keeping largely un- 
changed the long-range chromatin contacts. 



INTRODUCTION 

The timely firing of replication origins is essential for most 
eukaryotic cells to allow the proper duplication of the 
genetic material during the S-phase. Replication origins 



are activated according to a specific temporal program 
that has been studied in various species allowing to distin- 
guish sequences replicated in the early S-phase from se- 
quences replicated later (1-15). The replication timing is 
mainly cell-type specific, even if some loci exhibit a strong 
conservation of the replication timing over different cell 
types (8-14,16). The replication timing is significantly 
correlated with genomic and epigenetic features (reviewed 
in (17)). For instance, most of the early-replicated regions 
are gene-rich, transcriptionally active and localized in 
open chromatin, while the late-replicated regions are 
mostly found in silent, gene-poor and closed chromatin 
(2-8,14,18,19). The replication timing thus reflects several 
genomic, epigenetic and functional characteristics. 

Besides hnear features of chromatin, DNA replication 
has been related to the 3D organization of the nucleus. 
The replication occurs at discrete foci in the nucleus ac- 
cording to a specific spatiotemporal organization (20,21). 
Throughout the S-phase, new replication foci de novo 
assemble immediately next to the previously active ones 
(22-25), and a substantial segregation between early- and 
late-replicated loci has been observed (26-29). In addition, 
from the observation that thousands of replication forks 
are only found in hundreds of discrete BrdU-labeled foci, 
it has been proposed that DNA sequences replicating at 
the same time could gather (27,30-33), possibly through 
the anchoring of giant chromatin loops on the nuclear 
skeleton (reviewed in (34-36)). Courbet et al. (37) 
carried out elegant experiments showing that the size of 
the nuclear skeleton-anchored loops clearly depends on 
the velocity of the fork and on the density of initiations. 
They observed that experimental slowing down of rephca- 
tion forks is compensated for by the mobihzation of latent 
replication origins in the subsequent S-phase, and by the 
formation of smaller chromatin loops. This argues in 
favor of a replication-related high-order folding of chro- 
matin. The recent availability of replication timing data in 
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several cell lines now offers the opportunity to test this 
idea more systematically and to address to which extent 
a structural counterpart exists to the functional replication 
domains identified from the replication timing. 3C 
methodologies (38) have emerged as powerful strategies 
to analyze in detail the long-range folding of chromatin 
and by extension, the chromosome organization in the 
nucleus. Different genome-wide mapping strategies have 
been developed (39), including circular chromosome con- 
formation capture (4C) for the highly detailed analysis of 
DNA contacts made by selected loci (40,41), and Hi-C for 
a more global pairwise comparison of contacts between all 
genomic loci (42). 

Hi-C data have already been compared to the replica- 
tion timing of related cell hnes (13,14). Ryba et al. (14) 
observed a tight link between the replication timing and 
the Hi-C correlation matrices (principal eigenvector), 
showing that sequences localized near each other replicate 
at similar time. Moreover, Yaffe et al. (13) numbered more 
interchroniosonial interactions than expected between 
regions having similar replication timing. However, the 
Hi-C data do not always allow the direct visualization of 
the chromatin contacts for chosen loci, principally because 
of insufficient sequencing depth to cover all the combina- 
torial possibilities. Moreover, the normalization per- 
formed in the analysis of Hi-C data (42) gave as much 
weight to the very-distant interactions (over tens of 
megabases) as to the more local contacts (below few 
megabases), despite the fact that the latter are the most 
frequent ones. Essential points thus remain to be ad- 
dressed. First, a closer analysis of the local contacts may 
highhght replication timing-related structural domains in 
the nucleus. Then, a description of the contacts along the 
whole chromosome wiU determine if each locus replicating 
at the onset of the S-phase can be contacted by a given 
early-replicated viewpoint. Lastly, the comparison of the 
contacts in quiescent and cycling cells will address the 
question of their long-term persistence in resting cells. 

Here, we used the 4C methodology (40) in order to 
identify with high precision the main partners of 
multiple early- and late-replicated regions and to deter- 
mine to which extent the 3D organization of chromatin 
can be hnked to the replication timing. For this study, we 
focused on a large region (20 Mb) of the human genome in 
which the replication timing presents large fluctuations 
that are conserved in most examined ceU types 
(Supplementary Figure SI). In particular, these fluctu- 
ations define loci replicating earlier than their immediate 
surroundings (timing peaks), and conversely regions 
replicating latter than the surroundings (timing valleys). 
4C experiments were performed in a lymphoblastoid cell 
line, in resting and in cycling peripheral blood mononu- 
clear cells (PBMC) using 10 viewpoints in the selected 
region. We observed, and confirmed by fluorescent 
in-situ hybridizations (FISH) experiments, that early 
timing peaks coincide with frontiers limiting the spreading 
of strong local interactions and delineating niegabase- 
sized 3D-structural domains. In addition, at larger dis- 
tances, profiles of long-range interactions from timing 
peaks viewpoints faithfully reproduce the replication 
timing profile, which sheds a new light on the propensity 



of early- and late-replicating regions to contact other 
chromosome regions. These results are robustly observed 
in cychng and GO-blocked cefls, indicating that the 
observed organization exists in GO and does not result 
from the ongoing DNA replication. 

MATERIALS AND METHODS 

Cell culture 

A human lymphoblastoid cell line and PBMC were grown at 
37°C with 5% CO2 in RPMI supplemented with 10% fetal 
bovine serum, glutamax and penicillin/streptomycin. PBMC, 
which are mostly lymphocytes (43), were isolated from 
healthy donors using Leucosep® (Greiner bio-one) and 
Ficoll-Paque PLUS (GE Healthcare) according to manufac- 
turer's instructions. PBMC stimulation was performed by 
supplementing the culture medium with 1.5% Phytohemag- 
glutinin (PHA M Form, Life Technology) for 68 h. A million 
stimulated PBMC were fixed with cold ethanol and stained 
with propidium iodide (20 |xg/ml, 30min 37° C) for cell-cycle 
analysis (LSRII, FACSDiva™, BD Biosciences). 

4C protocol and sequencing 

Cells were processed for 4C experiment as previously 
described (40,44,45) with only minor modifications. Cells 
were dispatched into 7 miflions cell ahquots. All ahquots 
were fixed in phosphate buffered sahne 10% SVF and 2% 
PFA (lOmin, RT), lysed in lOmM Tris, lOmM NaCl, 
0.3% NP-40 and dounced (B pestle). The cross-hnked chro- 
matin was digested overnight with 600 U of Hindlll 
(Promega) and ligated with 150U of T4 DNA ligase 
(Fermentas). The templates were further cut with 50 U 
Hinlll (Fermentas) and circularized with 300 U T4 DNA 
ligase (Fermentas). DNA was purified with phenol extrac- 
tion followed by an ethanol precipitation, and chromatog- 
raphy (Roche High Pure PCR Product Purification Kit). 
DNA concentration was measured by fluorimetry 
(Quant-iT'^'^ dsDNA Assay Kit, Broad Range, Life 
Technology). For every primer pair used, 10 PCRs were 
performed on 200 ng input DNA (for primer sequences, 
see Supplementary Table SI). For primer design, we used 
a strategy similar to that described in Splinter et al. (44) and 
Noordermeer et al. (45). PCR products were sequenced 
using a Solexa Genome Analyser IIx or Solexa Hi-Seq 
2000 (76 bp). Reads were first processed to identify the 
viewpoints and then mapped on the human genome 
(UCSC hgl8 assembly) using Seqmap software (46). Only 
reads annealing on Hindlll restriction sites were kept for 
subsequent analyses. 

4C analyses 

The RAW 4C signal consists in the number of reads per 
restriction site. For local analyses, a running mean was 
applied on the RAW signal with a window size of 20 re- 
striction sites. To compare local 4C profiles between cefl 
hnes (Figure IC) the running mean signal was divided by 
the total number of reads corresponding to the chromo- 
some 5. For chromosome-wide and genome-wide 
analyses, we defined a long-range interaction frequency 
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Figure 1. 4C interaction profiles associated with replication timing in lymphoblastoid cells (A) MRT of the lymphoblastoid cell line Gm06990, in the 
portion 150-1 70 Mb of hmnan chromosome 5 (11). Early (respectively, late) replicating sequences correspond to MRT values ~0 (respectively, ~1). 
The arrowheads indicate the position of the viewpoints used for the 4C analysis. (B) Interaction frequencies profiles of the 10 viewpoints. For each 
panel, the light gray curve corresponds to the replication timing profile (right j -axis) and the colored filled curve to the 4C interaction frequency 
profile (left j'-axis). The arrowheads indicate the position of the viewpoints; the thin arrows indicate sharp drops in interaction frequency coinciding 
with early timing peaks; the curly brackets indicate partners of V3 and V4 viewpoints; and the dots indicate preferential partners of P viewpoints 
coinciding with timing peaks. (C) 4C interaction frequency is highly reproducible from one cell type to another. To normalize for differences in read 
numbers, the running mean value (B) has been divided by the total number of reads mapped on chromosome 5 (red = lymphoblastoid, 
green = resting PBMC and blue = PHA-stimulated PBMC). Top, V4 viewpoint; bottom, PI viewpoint. (D) Timing peaks correspond to zones of 
sharp changes in 4C data interaction frequency. (Top) The position of timing peaks are marked along the MRT profile in Gm06990 (vertical bars). 
(Middle) space-scale representation of the 4C interaction frequencies of S- and V-viewpoints. Each horizontal line corresponds to the median value of 
the interaction frequency computed in sliding windows of size ranging from 200 kb to 1.35 Mb. Due to the very large median- value dynamics, a 
logarithmic colormap was used (bottom). 



(LRIF) variable that reinforces the contribution of 
low-abundance densely packed reads. The LRIF signal 
is defined as the percentage of restriction sites covered 
by reads in a 500-kb sliding window (coverage) x the 
mean number of reads in this sliding window. The 
shding step was 100 kb. To compare the LRIF signal 
between cell hues, its value was divided by the total 
number of reads on the chromosome 5. Maxima from 
the LRIF signal were detected using an amplitude thresh- 
old-based algorithm (Matlab, The Mathworks). All plots 
were generated using Matlab. Heat-maps of pairwise 



distances between 4C peaks were generated by quantifying 
the percentage of cases where the distances between the 4C 
peaks are <250kb. The value of this percentage was 
encoded using a green-red colormap. Matlab linkage 
function was used to perform the hierarchical clustering. 

FISH experiments 

We selected PBMC to perform FISH experiments, thus 
avoiding ambiguities due to cells in S or G2 phase exhibit- 
ing replication doublets. Resting PBMC were swollen in 
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75 mM KCl (30min, 37°C), fixed in methanol 3:1 acetic 
acid, and then spread on microscope glass shdes. FISH 
probes were generated by random priming on BAC 
clones (Supplementary Table S2) using amino-allyl 
dUTP. Probes were then coupled with amine-reactive 
alexa fluorophores (A488, A555 or A647, Life 
Technology). The specificity of BAC probes was tested 
by both restriction profile and hybridization on metaphase 
spreads. Hybridization of denatured probes (80°C, 5 min) 
on denatured nuclei (73°C, 2 min) were performed over- 
night at 37°C (in 50% formamide, 2 x SSC, 10% dextran, 
10 |ig single-stranded DNA, 1-3 ^g human CotI). SHdes 
were mounted with fluoromount-G (Southern BioTech) 
and DAPl was used as DNA counterstain. Images were 
collected using a wide-field Zeiss Axioimager (63 x, 
Photometries CoolSNAP HQ camera) and analyzed with 
Image J. The Rayleigh fits of measured distances 

f(x,a) — ^exp^=^^ were performed using Matlab (ct cor- 
responds to the shape parameter). The mean square 
physical distance is defined as (r^) = 2cr', according to 
Yokota et al. (47). 

Mean replication timing profiles and replication 
U-domains 

We used the mean replication timing (MRT) profiles along 
the complete human genome derived by Baker et al. (48) 
from Reph-Seq data (11,15). The signal analysis per- 
formed in (48) led to the segmentation of about half of 
the genome into replication timing domains corresponding 
to U-shaped MRT profiles (late-replicating core sur- 
rounded by two early-replicating loci). MRT profiles 
and U-domains coordinates for Gm06990 and K562 
were obtained directly from the authors. 

Whole-genome Hi-C chromatin conformation data 
analysis 

We used the spatial proximity maps of the human genome 
generated using Hi-C method (42). We downloaded 
100-kb resolution maps for Gm06990 and K562 cell 
lines from the GEO web site (GSE18199 binned 
heatmaps): http://www.ncbi.nlm.nih.gov/geo/query/acc. 
cgi?acc = GSE18199. Interaction counts between loci on 
same chromosome were normalized by the mean 
intrachromosomal interaction count averaged over all 
pairs of loci separated by the same genomic distance. To 
properly analyze the behavior of interaction frequencies 
across timing peaks, we selected weU-defined timing 
peaks that are shared by two consecutive U-shaped repli- 
cation timing domains (48). We then extracted, from the 
Hi-C interaction ratio matrices, two 4C-like interaction 
profiles converging to the shared timing peak: the first 
one coming from the leftmost U-domain border and the 
second one coming from the rightmost U-domain border. 
All 4C-like interaction ratio profiles were finally averaged 
after rescahng the distances so that the U domain size was 
1 (Figure 5, top). 



RESULTS 

We used 4C in combination with high-throughput 
sequencing (4C-seq) (44) to explore the correlation 
between replication timing and chromosome 3D structure. 
4C is used to capture, in the whole genome, the loci inter- 
acting with a chosen sequence, namely the viewpoint. We 
focused on a large (20 Mb) region of human chromosome 
5 in which the replication timing measured in Gm06990 
lymphoblastoid cell line presents large fluctuations and is 
highly conserved in different cell types (Supplementary 
Figure SI). As the replication timing of lymphoblastoid 
cells appears extremely similar, even when comparing es- 
tablished cell hnes from different sources and histories 
(49), the replication timing of the lymphoblastoid ceU 
hne used in this study is very hkely close to the one of 
Gm06990. 

The shape of the timing profile in the selected 20 Mb 
displays timing peaks, i.e. regions replicating earlier than 
their immediate surroundings, and conversely local timing 
valleys (replicating later than their surroundings). 4C 
viewpoints were then unambiguously positioned on some 
local timing peaks or in timing valleys (Figure lA). Five 
viewpoints were selected on early peaks (P1-P5), three in 
very late valleys (VI, V3 and V4), one in a valley of inter- 
mediate timing (mid S-phase, V2) and one in a transition 
from very early to mid S-phase (S). The local chromatin 
and genomic features around the viewpoints are shown in 
Supplementary Figure S2. A mean of 2.6 niilhons reads 
per viewpoint was generated in lymphoblastoid cells 
(Supplementary Table S3). About 76% of the mapped 
reads are located on chromosome 5 and a much lower 
read density was observed in other chromosomes 
(Supplementary Table S4), which is in agreement with 
the existence of chromosome territories. Similar propor- 
tions of intrachromosomal interactions were observed for 
the other ceU types used in the study. Within chromosome 
5, most contacts were local and the interaction frequencies 
rapidly dropped with increasing genomic separation 
(Supplementary Table S3), as previously reported (39). 

Early timing peaks delineate structural self-interacting 
domains and preferentially contact each other 

4C data were locally averaged using a running mean 
within a window of 20 Hindlll sites (~100kb). The 
plots show that interaction frequencies of the valley view- 
points (VI, V2, V3 and V4), as well as of the S viewpoint, 
quickly decrease to a few percents of the maximum signal 
over a few megabases distance from the considered view- 
point (Figure IB). This decrease does not follow a simple 
pattern; indeed, sharp drops can be seen. Strikingly, most 
of these drops seem to be localized at, or close to, timing 
peaks (Figure IB, arrows). This suggests that contacts 
between late-replicating regions are somehow limited by 
intervening timing peaks. To further illustrate the exist- 
ence of these sharp drops in the 4C profiles, we displayed 
4C data using a space-scale representation. Since 4C raw 
signals present strong deviations from the local mean, we 
represented the median value in sHding windows of size 
ranging from 200 kb to 1.35Mb. The resulting images 
were obtained by encoding the median values with a 
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logarithmic color map (Figure ID). It became apparent 
that 4C signal amplitudes are larger within a domain 
bordered by two timing peaks, and that clear transitions 
occur at timing peaks. This analysis confirmed that timing 
peaks delineate frontiers limiting the DNA contacts of 
neighboring late-replicating regions, and thus sheds Hght 
on the existence of self-interacting chromosomal domains 
bordered by timing peaks. Interestingly, beyond this local 
regime, late-replicated loci preferentially interact with 
other late-rephcating regions, as can be seen for the 
long-range interactions of the V3 and V4 valley viewpoints 
(Figure IB, curly brackets). 

The interaction frequencies obtained with viewpoints 
localized on timing peaks (P1-P5) display profiles 
strongly different from those of the valley viewpoints. 
PI, P2 and P4 (and to a lesser extent P3) profiles exhibit 
preferential contacts with nearly every neighboring timing 
peak, even with those located 10-Mb away (Figure IB, 
dots), showing that they loop out the intervening late- 
replicating domains to contact each other (see also the 
space-scale representation in Supplementary Figure S3). 
Although localized on an early peak, the P5 viewpoint 
does not display specific contacts with other timing 
peaks, which might be due to specific features of this view- 
point (see 'Discussion' section). Interestingly, whereas the 
contacts between timing valleys are spread over megabases 
(Figure ID, curly brackets), the contacts between 
early timing peaks are restricted to shorter distances 
(Figure ID, dots), suggesting the existence of a stronger 
and more specific anchoring mechanism between early 
timing peaks than between late regions. 

To address whether this replication-related 3D organ- 
ization of chromatin is also found in non-cychng cells, we 
performed similar analyses in resting PBMC, and, as a 
control, in cychng phytohemagglutinin-stimulated 
PBMC (Supplementary Figure S4). For both ceU types, 
we used the replication timing of Gm06990 cell fine as a 
reference since the timing profile in the region containing 
the viewpoints is remarkably similar in different ceU types 
(Supplementary Figure SI) and since B- and T-ceUs have 
divergent timing on only 4.5% of the genome (49). The 
local 4C profiles obtained with the three different 
cell types remarkably superimpose on each other 
(Figure IC), arguing for the existence of a robust and 
common 3D chromatin conformation. In resting and 
cycling PBMC, we also observed frontiers localized at 
timing peaks that prevent the contacts of neighboring 
late-replicated regions (Figure IC and Supplementary 
Figures S5 and S6). In addition, using timing peaks as 
viewpoints, we noticed preferred contacts with every 
timing peak. This not only confirms the preferential 
contacts between early-replicated regions as previously 
observed (13), but more strikingly also shows that the 
contact frequency profile is strongly correlated with the 
timing profile in the three related cell types. Moreover, 
the fact that the local conformation in resting PBMC is 
similar to the one in cycling cells implies that this correl- 
ation exists in GO and does not result from ongoing DNA 
replication. It suggests that this organization pattern is 
conserved all along the cell cycle. 



LRIF profiles faithfully reproduce the replication 
timing profile 

In 4C experiments, the measured interaction frequencies 
have been shown to rapidly drop with increasing genomic 
distance (50). As a consequence, only few informative 
Hgation events can be captured while analyzing long-range 
contacts. To collect the long-range contacts, it has been 
shown that the contribution of isolated reads improperly 
amplified by PCR should be diminished, and the contri- 
bution of low-abundance densely packed reads reinforced 
(40). Accordingly, we defined a variable called LRIF (see 
'Materials and Methods' section) that operates such a fil- 
tering. While the LRIF signal is synchronous with the 
coverage and the running mean defined at 500 kb 
(Supplementary Figure S7), it offers better signal-to- 
noise ratio, facihtating subsequent analyses. 

AU along the chromosome, the long-range contacts 
obtained from VI, V3 and V4 on the one hand, and 
from PI, P2, P3 and P4 on the other hand, display 
opposed oscillations in the three cell types tested, i.e. P 
viewpoints maxima coincided with V viewpoints minima 
and vice versa (Figure 2A and B and Supplementary 
Figure S8). To quantify this observation for each view- 
point on the whole chromosome 5, we detected long-range 
contacts by local maxima in the LRIF signal, and 
measured the pairwise distances separating each local 
maximum from the closest local maximum of another 
viewpoint. As an illustration of the results, the histogram 
of distances between P4 and PI peaks is centered at zero, 
indicating that the long-range contacts of PI and P4 view- 
points overlap and that PI and P4 viewpoints indeed 
interact with the same partners (Figure 2D). On the 
contrary, the V4-P1 distance distribution is clearly 
depleted around zero, showing that V4 and PI have dif- 
ferent distant partners (Figure 2D). To depict all the 
pairwise combinations, we generated a heat-map 
in which a green-red colormap quantifies overlap of 
long-range contacts identified from LRIF signals 
(Figure 2E). The unbiased hierarchical clustering per- 
formed revealed the existence of three groups. The first 
one corresponds to viewpoints on timing peaks PI, P2, 
P3 and P4 that display LRIF signal in opposition of 
phase with the second group that includes valley view- 
points VI, V3 and V4. The third group corresponds to 
V2, S and P5 viewpoints that display intermediate 4C 
profiles; the hierarchical clustering performed yet tends 
to position them closer to early viewpoints than to late 
ones. Similar results were obtained with the three cell 
fines used in this study, showing the remarkable conserva- 
tion of the long-range interactions frequencies (Figure 2C 
and E). Taken together, we conclude that chromosomal 
domains with opposite replication timing tend to segregate 
in nuclear space, and they do so even in non-dividing qui- 
escent cells. 

We then asked to which extent the long-range partners 
of every viewpoint coincide with timing peaks. We there- 
fore measured, for each viewpoint and on the whole 
genome, the distances between every 4C peaks and the 
closest timing peak. The obtained histograms for 
lymphoblastoid cells (Figure 3A) clearly show that the 
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Figure 2. Chromosomal domains with opposite rephcation timing segregate in the nuclear space. (A) LRIF in lymphoblastoid cells for the 10 
viewpoints (see text and Materials and Methods' section for the deiinitions). The colored filled curves correspond to the 4C profiles in a region of 
chroinosome 5 centered ~60Mb upstream of the viewpoints. The light gray lines correspond to the replication timing in lymphoblastoid cells (11) 
plotted with the same r-axis as in Figure lA. (B) Similar analysis performed with resting PBMC. (C) Comparison of LRIF values of the PI 
viewpoint in lymphoblastoid cells (red), resting PBMC (green) and PHA-stimulated PBMC (blue). LRIF values have been normalized for differences 
in read numbers. The upper panel corresponds to the replication timing in lymphoblastoid cells. (D) Distribution of distances between the 4C peaks 
detected in the LRIF profiles. We detected the 4C peaks and measured the distances between each of them and the closest 4C peak obtained with 
another viewpoint. All pairwise distances were measured over the whole chromosome 5. Three distance distribution histograins are shown here as 
examples (V4-P1, V4-V3 and P4-PI). A distribution centered on zero (as P4-P1) indicates that the LRIF maxima of the two viewpoints overlap. 
(E) To quantify the similarities between LRIF signals from different viewpoints, we calculated the percentage of occurrences where the pairwise 
distances measured in (D) are <250kb. This percentage was encoded with a green-red color map. Similar signals are green (V4/V3 and P4/P1), 
whereas out of phase signals are red (V4/P1). The asterisks correspond to the distribution histograms shown in (D). 
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each of them and the closest tiining peak. The three histograms represent theses distances measured in lymphoblastoid cells for P4 (left), V3 (middle) 
and V2 (right) viewpoints. A distribution centered on zero (P4) indicated that the 4C peaks are close to timing peaks, n = number of 4C peaks 
detected. (B) We computed the percentage of distances <250 kb and represented this value with a green-red color map. 4C signals in phase with the 
replication timing are green (P4) whereas anti-correlated signals are red (V3). 



long-range contacts of the P4 viewpoint, for example, are 
close to timing peaks, as the histogram is centered on zero. 
On the contrary, the histogram for the V3 viewpoint is 
depleted in values around zero indicating that V3 
partners are away from timing peaks. V2 viewpoint 
exhibits an intermediate situation: the distribution histo- 
gram is neither peaked nor depleted around zero. To 
depict this information for aU viewpoints and the three 
cell lines, a green-red colormap was used to show the co- 
incidence between timing peaks and long-range contacts 
(Figure 3B). Our results show that the long-range partners 
of PI, P2, P3 and P4 viewpoints overlap with timing peaks 
for the whole genome. Conversely, VI, V3 and V4 do not 
interact with timing peaks. 

OveraU, we observed that the LRIF profiles correlate 
genome wide with replication timing. Strikingly, they also 
reproduce the replication timing signal over very large dis- 
tances along the chromosome 5, meaning that a given 
timing peak can interact with any other timing peak of 
the whole chromosome 5. Interestingly, similar tendency 
can still be observed on other chromosomes 
(Supplementary Figure S9), albeit with a much lower 
interaction strength (about two orders of magnitude). 
The fact that the interaction frequency profiles tightly 
follow the replication timing strongly argues in favor of 
a replication imprint on chromatin organization. 



In-situ distance measurements confirm the 
physical proximity between loci corresponding to 
early timing peaks 

To test whether some of the contacts captured by the 
4C-sequencing (4C-Seq), which reflect chromatin folding 
at the population level, can be observed in single ceUs, we 
performed FISH in resting PBMC. We selected BAC 
probes localized in the 150-1 70 Mb portion of human 
chromosome 5. Using three colors FISH, we measured 
three different sets of physical distances, a first one (EE) 



corresponding to distances between two local timing 
peaks, a second set (LL) corresponding to distances 
between two timing valleys and a third set (LE) corres- 
ponding to distances between a peak and a valley 
(Figure 4A). Results can be depicted by cumulative histo- 
grams. For example, as predicted by the 4C results, the d5 
(EE) physical distance is smaller than dl7 (LE) even if both 
correspond to a 7.6 Mb genomic length (Figure 4C(3), 
P<IQ~^, Student's Mest with a-risk of 0.05). 
Furthermore, the d3 physical distance (EE) is slightly 
smaller than dl (LE) and d2 (LE) although d3 corresponds 
to a much larger DNA fragment than both dl and d2 
(Figure 4C(1)). Likewise, d28, d29 and d30 present 
similar physical distances although d30 (EE) corresponds 
to a genomic length twice larger than the LE distances d29 
and d30 (Figure 4C(2)). From these three examples, we 
concluded that d3, d5 and d30, which correspond to 
physical distances between timing-peaked regions, are 
smaller than expected if the genomic separation was the 
only parameter determining the 3D distance in the nucleus. 

It is established that the distance histograms can be 
easily fitted by a Rayleigh law (Figure 4D) that only 
depends on a single parameter a (see 'Materials and 
Methods' section). This Rayleigh parameter perfectly de- 
scribes the shape of the obtained histograms and can be 
used to define, for each probe combination, the corres- 
ponding mean square physical distance (47). We plotted 
the mean square physical distance as a function of the 
genomic separation between the probes (Figure 4E). 
Only the dots corresponding to EE distances are labeled 
in Figure 4E, but complete labels and pairwise statistics 
can be found in Supplementary Figure SIO. Among the 
dots deviating from the mean tendency, most correspond 
to peak-to-peak distances (d3, d30, d5, dI8 and to a lesser 
extent d6). In particular, d3 is the smallest physical 
distance, in agreement with 4C results showing that the 
strongest interaction in PBMC occurs between P3 and P4 
peaks (Supplementary Figure S5). This indicates that, for 
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Figure 4. FISH distance measurements confirm the physical proximity between early replicated regions. (A) Physical distances measured by three 
colors FISH in resting PBMC. The arrowheads correspond to the positions of the used probes (Supplementary Table S2), and the line to the 
measured distances. Three sets of measurements were performed: EE between two timing peaks, LL between two timing valleys and LE between a 
timing peak and a timing valley. (B) Example of FISH images in resting PBMC nuclei (dl = GB, d2 = BR, d3 = GR, d28 = BR, d29 = GR, 
d30 = GB). On both chromosomes, the physical distances separating the gravity centers of the spots were measured in PBMC nuclei. (C) Cumulative 
distance measurements. (1) d3 (3 Mb) is slightly smaller than dl (1.9 Mb) and d2 (1.1 Mb, F < 10"^, Student's Mest). (2) d28, d29, d30 have the same 
physical distance even if d30 is 5 Mb and d28/d29 are ~2.5Mb. (3) d5 is smaller than dl7 even if they both cover 7.6 Mb (P < 10"**, Student's /-test). 
(D) Relative frequency histograms of d28 distance (2.5 Mb). The green lines correspond to the fit of the frequency histograms by a Rayleigh law (see 
'Materials and Methods' section). (E) Mean squared physical distance ±95% confidence interval (see 'Materials and Methods' section and (47)) as a 
function of the genomic separation. Red dots, measurement between two timing peaks (EE); blue, between two late regions (LL); green, between a 
peak and a timing valley (LE). Open dots correspond to distances measured inside a single replication domain; close dots correspond to distances 
measured between different replication domains. Here, a replication domain is defined by its two timing peak borders surrounding a late-replicated 
core. P-values shown are evaluated for few EE distances and correspond to pairwise Student's /-tests with other measured distances with similar 
(± 15%) genomic separation. (F) Physical distances and interaction frequencies between two probes localized within the same replication domain or 
between two adjacent replication domains. Top: corresponding distances. Middle: as in (E). Bottom: percentage of cases where measured distances 
are <0.3 |xm. P-values correspond to pairwise Student's /-tests. 
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the same genomic separation, the physical distance 
between two early replicated loci tends to be smaller 
than the distance between a late and an early replicated 
locus. Nevertheless, some early-to-early physical distances 
are not smaller than the mean tendency (d4/dll, dl2 and 
dl5). Therefore, if 4C contacts are not necessarily seen by 
FISH, the FISH results largely agree with those obtained 
in 4C experiments. 

We did not detect spatial proximity between probes in 
timing valleys (Figure 4E). This may be explained by the 
fact that contacts between timing valleys are spread over 
the width of the valleys (Figure IB, curly brackets), sug- 
gesting a more relax interaction pattern between 
late-replicated regions than between timing peaks. This 
may also reflect a relative discrepancy between the estab- 
lishment of contacts (as probed by 4C approach) and the 
physical proximity seen by 2D-FISH. 

The 4C profiles in PBMC indicate that timing peaks are 
found at frontiers limiting the contacts of neighboring 
late-replicated regions (Supplementary Figure S5). To 
test whether this is also observed by FISH, we performed 
measurements around the P4 timing peak. For compar- 
able genomic separation, physical distances measured 
across the timing peak tend to be larger than those per- 
formed in the valley (Figure 4F, middle). This tendency is 
supported by the frequency of colocalization between 
the probes used above in the distance measurements 
(Figure 4F, bottom). Probes separated by the timing 
peak less frequently colocalized than probes located in 
the same timing valley. These observations strengthen 
the conclusion that a timing peak can limit the interaction 
capabilities of a late-replicated sequence with other 
regions which thus contributes to define independent 
megabase-sized self-interacting structural domains in the 
nucleus. 



DISCUSSION 

In the context of DN A replication, the study of structure- 
function relationships in the nucleus remains challenging. 
Indeed, some genomic and epigenetic features are of ut- 
termost importance in the definition of replication origin 
(reviewed in (51)) while replication itself leaves some evo- 
lutionary imprints in the genome (16,52,53). Here, we per- 
formed a series of 4C-Seq and FISH experiments that 
reveal to which extent the 3D conformation of chromatin 
is hnked to the rephcation timing. 

At the local scale (1-2 Mb), that is the typical size of a 
replication timing domain (8,14,48), we found that 
adjacent timing peaks are prone to physically interact, 
by looping out the intervening late-replicated sequences 
(Figure IB and Supplementary Figures S5 and S6). We 
recently proposed a segmentation of about half of the 
genome into U-domains (see 'Materials and Methods' 
section) defined by a late-replicating core surrounded by 
two timing peaks (48). When averaging the Hi-C data 
over these U-domains, we recovered the preferential inter- 
actions between adjacent timing peaks in Gm06990 cells 
(Figure 5, left), which corroborates genome wide our con- 
clusions. Moreover, we also noticed similar behavior 
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Figure 5. Hi-C interaction profile across timing peaks. Average Hi-C 
interaction ratio across timing peaks localized at the border between 
two consecutive U-domains (top, see 'Materials and Methods' section). 
Bottom, the mean interaction ratio profile as a function of the genomic 
distance rescaled so that U-domain size be 1. It corresponds to an 
average over n = 465 (respectively, n = 428) timing peaks in 
Gm06990 (respectively, K562) cell line. 



in the chronic myelogenous leukemia K562 cell fine 
(Figure 5, right), indicating that preferential interactions 
between adjacent timing peaks are not limited to lymphoid 
cell types. Nevertheless, our 4C experiments allow to go 
much further in details since most of the local interactions 
(Figure IB, dots) are not detectable in Hi-C matrices 
(Supplementary Figure Sll). In addition to the preferen- 
tial interaction between adjacent timing peaks, we also 
observed that frontiers limiting the DNA contacts of 
late-replicating regions are found at timing peaks. This 
was observed by 4C (Figure 1 and Supplementary 
Figures S5 and S6) and by FISH (Figure 4F). Since the 
timing peaks delineate boundaries of chromosomal 
domains, we propose to define a structural replication 
domain by its U-shape consisting in a late-replicated 
core surrounded by two local early-replicated peaks. 
Therefore, our definition is complementary to the defin- 
ition of functional domains based on the replication 
program (54). As a support for the existence of these 
domains, we note that the structural frontiers at timing 
peaks significantly coincide with CCCTC-binding factor 
or CTCF (Supplementary Figure Sll) previously shown 
to be enriched at the boundaries of chromatin domains 
(55,56). 

At the 10 Mb scale, we also showed preferential inter- 
actions between timing peaks (Figures IB and 4E and 
Supplementary Figures S5 and S6). Similarly, late- 
replicated viewpoints (V3, V4) preferentially interact 
with timing valleys. However, their interactions are 
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spread over almost the entire length of the valley, 
indicating a relative lack of specific partners (Figure IB 
and Supplementary Figures S5 and S6). This might 
explain why we did not detect any physical proximity by 
FISH between late-replicated sequences (Figure 4E). 

At the scale of entire chromosomes, we demonstrated 
that chromosomal domains with an opposite replication 
timing are separated in the nuclear space (Figure 2E) and 
that an early-replicated (respectively, late-replicated) locus 
interacts with other timing peaks (respectively, timing 
valleys; Figures 2A and B and 3). Previous works that 
compared Hi-C data and replication timing values, 
already proposed that synchronously replicated sequences 
interact more frequently than expected (13,14). However, 
the 4C protocol provides a much higher number of reads 
for each viewpoint, thereby aUowing the direct detection 
of chromosome-wide interactions. The 4C signals of PI, 
P2 and P4 viewpoints faithfully reproduce the replication 
timing along the whole chromosome 5, which demon- 
strates the astonishing persistence, all over the chromo- 
some, of a timing-linked long-range conformation of 
chromatin. In addition, our analysis showed that the 
niid-S-phase replicated P4 peak has a long-range 
interactions profile similar to those of PI, P2 and P3 
(Figure 2A and B and Supplementary Figure S8). This 
means that it is probably not the precise replication 
timing value at a given peak that specifies the long-range 
contacts but rather the timing value relative to its 
surroundings. The shape of the timing profile, with its 
local maxima, therefore better predicts the chromatin 
contacts than the fact of being replicated at the onset of 
the S-phase. This indicates that, irrespective of its replica- 
tion timing value, a timing peak may comprise determin- 
ants governing the structural conformation of chromatin. 

This spatial segregation between early- and late- 
replicated domains is in agreement with previous results 
showing the relative absence of interactions between active 
(early-replicated) and inactive (late-replicated) chromatin, 
both by 3C-derivated methodologies (40,42,57) and cyto- 
logical analyses (26-29). The nuclear processes explaining 
this separation remain to be identified. Insight may come 
from the P3 viewpoint that is localized in transcriptionally 
silent chromatin, ~100-kb away from the nearest ex- 
pressed gene in PBMC. P3 exhibits in this cell type a 
positive correlation with the timing, albeit slightly lower 
than the one measured for PI, P2 and P4 viewpoints that 
are localized within expressed genes (Figure 3 and 
Supplementary Figure S2). This might indicate that the 
influence of transcription and replication probably adds 
up to enhance the long-range interaction capabilities of 
selected locus, maybe through the recruitment to spatially 
coordinated replication and transcription factories (58). 
This hypothesis is further sustained by the results 
obtained for P5. This viewpoint lies in a weakly expressed 
gene (Supplementary Figure S2) and is not located on a 
sharp timing peak (350-kb shoulder in the replication 
timing curve; Figure 1). In that context, we assume that 
the contribution of both transcription and replication are 
weaker, explaining why P5 interactions with timing peaks 
cannot be detected beyond the neighboring P4. 



Our analysis revealed another striking property of chro- 
matin reorganization while lymphocytes exit quiescence. 
Quiescent cells and cycling cells are known to have 
distinct nuclear organization (59-61). For instance, 
lymphocyte activation is associated with major changes 
in gene expression (62-66) and major histological modifi- 
cations of the nucleus with a doubhng of nuclear volume 
and a significant reduction of peripheral heterochromatin 
(67,68). Despite these important modifications of nuclear 
architecture, a high similarity in local and long-range 
chromatin contacts was observed between resting and 
activated PBMC (Figures IC and 2C). As PBMC ceUs 
were efficiently pushed out of dormancy (Supplementary 
Figure S4), the absence of differences between resting and 
activated lymphocytes does not result from an inefficient 
PBMC activation by PHA. This implies that the histolo- 
gical disappearance of heterochromatin during lympho- 
cyte activation takes place without disruption of the 
4C-probed long-range interactions and while most of the 
neighboring relationships are maintained. The stability of 
these interactions may rely either on constant structural 
elements ultimately determined by the genome sequence, 
either on some nuclear function shared by resting and 
cychng cells. As a support of the latter, it has been 
shown that the quiescent CI 27 ceUs (mouse mammary 
epithelial cells), when in vitro replicated, retain a replica- 
tion timing program similar to the one observed in cychng 
C127 cells (61). 

In summary, using 4C-Seq experiments, we demonstrate 
that the spatial segregation of early- and late-replicating 
loci (13,14,26-29) is associated with the establishment of 
robust chromatin contacts. The 4C interaction profiles, 
supported by FISH, highhght the existence of 
megabase-sized self-interacting domains in the nucleus 
whose boundaries coincide with early-initiation zones. 
Along the whole chromosome, we also directly observed 
a considerable correlation between the replication timing 
profiles and the long-range contact frequencies of every 
viewpoint, which could not be observed in previous 
studies (13,14). Moreover, our work shows that it is not 
the precise replication timing value that specifies the 
long-range contacts but rather the timing value relative 
to its surroundings. This chromatin organization scheme 
is shared by cycling and resting PBMC. This replication- 
associated structure stably observed in quiescent as well as 
in activated cells likely reflects some fundamental elements 
of the chromosome 3D structure that remain to be 
determined. 
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